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Abstract 

We consider the edge-based compartmental models for epidemic spread developed in [5]. We show 
conditions under which simpler models may be substituted for more detailed models, and in so doing we 
define a hierarchy of epidemic models. In particular we provide conditions under which it is appropriate 
to use the standard mass action SIR model, and we show what happens when these conditions fail. 
Using our hierarchy, we provide a procedure leading to the choice of the appropriate model for a given 
population. Our result about the convergence of models to the Mass Action model gives clear, rigorous 
conditions under which the Mass Action model is accurate. 

1 Introduction 

The spread of epidemics through populations is affected by many factors such as the infectiousness of the 
disease, the duration of infection, the distribution of contacts through the population, and the typical 
duration of contacts. Typically for predicting epidemic spread or intervention effectiveness, we want an 
accurate, but simple, model that captures the relevant effects. 

In our earlier work [5], we introduced the edge-based compartmental modeling approach for the spread 
of SIR diseases in populations with different contact dynamics. In particular we showed that edge-based 
compartmental models can capture contact duration and social heterogeneity (variation in contact levels) 
simultaneously in mathematically and conceptually simple terms. The models we studied all assumed that 
the population was made up of individuals who were identical except for their contact levels. We modeled 
the population as a network, with nodes representing individuals joined by edges representing potentially 
transmitting contacts. We also assumed a simple disease, with transmission occuring at rate (3 per edge and 
recovery occuring at rate 7. In this paper we investigate the relationships between models and how to choose 
the simplest model appropriate for a given population. 

The models discussed in [5] can each be classified by two features: the first is how contact duration is 
determined and the second is how contact levels are distributed through the population. Table [l] summarizes 
the models and their underlying assumptions. 

The contact duration falls into three possibilities: it can be permanent, finite, or fleeting. In the per- 
manent case, a contact that exists at any time has always existed and will always exist. In the finite case 
contacts may change over time. In the fleeting case, contacts are so brief that over any macroscopic time 
scale an individual samples a very large number of neighbors, and so it is safe to assume that the total 
contact time with infected individuals matches its expected value. 

The distribution of contact levels can be split into two types. In one set of models (expected degree 
models), we assign an expected degree k to a node. The probability that an edge exists between two nodes 
is proportional to the expected degrees of each node. Edges are created independently of one another, so 
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Contact Duration 


Heterogeneity type 


Configuration Model (CM) 


Permanent 


Actual degree k 


Mixed Poisson (MP) 


Permanent 


Expected degree k 


Dynamic Variable-Degree (DVD) 


Finite 


Expected degree n 


Dynamic Fixed-Degree (DFD) 


Finite 


Actual degree k 


Mean Field Social Heterogeneity (MFSH) 


Fleeting 


Contact rate k or k 


Dormant Contact (DC) 


Finite active and dormant 


Maximum degree k m 


Mass Action (MA) 


Fleeting 


None 



Table 1: The basic models and their underlying assumptions. Contacts are assumed to form and break at 
some rate which can vary from zero (permanent) to infinite (fleeting). Depending on the process governing 
contact formation, we may know the actual degree k of an individual or we may know its expected degree. 

the existence of an edge between u and v does not alter whether an edge can exist between u and w. Note 
that the expected degree can take any non-negative real value, and we assume that the distribution of k is 
given by the probability density function p{n). In the other set of models (actual degree models) we assign 
an actual integer number of contacts fc, the degree, to each individual though in the case of the dormant 
contact model not all of these contacts must be active at all times. We think of an individual as having k 
stubs (or half-edges) which pair randomly with stubs of other nodes to form edges. In this case the existence 
of an edge between u and v removes an available stub from it, and so it affects the probability of an edge 
between u and w. The distribution of degrees is given by the probability mass function P(k). 

The distinction between the expected degree models and the actual degree models becomes apparent 
when we calculate the probability that an individual is susceptible. For the expected degree models, there is 
a continuum of risk levels and so we will have to calculate the per-expected degree probability of not having 
been infected. In contrast for the actual degree models the risk is discretized. The calculation is slightly 
different in each case, but the underlying concepts arc the same. 

The equations produced by edge-based compartmental models are surprisingly simple. Nevertheless the 
cases with simpler assumptions lead to simpler equations, and so it is worth knowing what conditions allow 
the use of a simpler model. In general when contact duration is short compared to the infection and recovery 
time scales it is safe to use the fleeting contact models, while if contact duration is long compared to the 
duration of the epidemic it is safe to use the permanent contact models. However, there are some less 
obvious limits. If the average degree (K) is large while the rate of transmission per edge scales such that (3 
is of order 1 / (K) and the recovery rate 7 is fixed, then the probability that any given edge transmits even 
once is small. The probability it transmits twice is negligible: the disease "sees" an edge at most once, so 
whether it is permanent, fleeting, or finite has no impact on disease spread. Thus we may treat the model 
as if the contacts are fleeting, which simplifies the equations. If further the contact distribution is such that 
the contact levels are generally close to the mean, then we can neglect variation in contact levels, and turn 
to the simple mass action model of 0^ . The precise condition required for this is somewhat technical: we 
must have | (K 4 ) - (K) 4 \/ (K) 4 < 1 and (K) > 1 with /3 = /3 (K) fixed. When this docs not hold, there 
can be a significant deviation away from the mass action model. 

In this paper we begin by providing a flow chart leading to selection of the appropriate model for a given 
population. Following the flow chart, we introduce the hierarchy of the models which underlies the flow 
chart. We describe the precise assumptions of the models and demonstrate some of the simpler parts of the 
hierarchy. After that, we consider some of the more difficult aspects of the hierarchy. We finally discuss some 
of the implications and limitations of our approach. In particular, we note that we give a simple heuristic for 
when the Mass Action equations are appropriate. In the Appendix we provide more rigorous justifications 
for the claims in the hierarchy section. Because of its importance, we include the rigorous proof of conditions 
under which the Mass Action equations hold in the main text. 
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• Ql: Is the average degree (A) large and does the probability that an infected node recovers before 
transmitting scale like 1/ (A)? 

or 

Is an existing edge from an infected neighbor much more likely to be deleted than to have infection 
transmit? 

• Q2: Do almost all contacts stay the same during the time scale of the epidemic? 

• Q3: Is | (A 4 ) — (A) 4 |/ (A") 4 <C 1 where (A 4 ) is the average fourth power of the degree and (A) is the 
average degree? 

• Q4: Can we measure the actual number of contacts an individual has, or just the expected number? 

• Q5: Does a new contact only form when an existing contact breaks? 

• Q6: For a given individual, are new contacts created at a constant rate regardless of the existing 
number of contacts? 

Figure 1: Flow chart for model choice. Depending on the how the edge turnover rates compare to the 
infection and recovery rates, as well as the degree distribution, we arrive at different models. The model 
abbreviations are as in Table [T] This flow chart assumes that all structure in the population is due to 
heterogeneity in contact levels. 



2 Model Selection 

In Figure [I] we present a flow chart that can be used to select the appropriate model for a given population. 
The conditions depend on the degree distribution and the rate at which edges change. These are relatively 
straightforward to measure for a population [§1 [TSJ [TT] . From the observations of population and disease 
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parameters, we can choose the simplest model to accurately represent disease spread in a given population. 
The equations for each model are described in Section [3] 

Most existing modeling of infectious disease spread are based on the Mass Action model [2]. This flow 
chart gives appropriate conditions under which this model is reasonable. In general, we must have most 
degrees close to the average degree, but at the same time either contacts must have high turnover or the 
probability of transmission per contact is very low. Under these conditions, it is appropriate to use the MA 
model. 

Before using this flow chart it is always prudent to be sure that the population does not violate other 
assumptions of the models. For example, different contact structure between age groups may require more 
consideration. If there are important features not captured by these models it may be possible to develop 
a custom model that captures the relevant detail (see [5J [T2] ) . Otherwise we may not be able to rely on 
edge-based compartmental models and may have to use simulation. 

The questions asked are somewhat vague in the sense that whether a number is large or not is somewhat 
a matter of opinion. In Section [3] we address this in more detail. We show how the results of the models 
converge as the parameter values change. 

3 The model hierarchy 

In this section we investigate the hierarchy of Figure [2] underlying the flow chart in Figure [l] We first give 
a brief overview of the standard simple model, the Mass Action model. We then consider the hierarchy 
of models. We find it convenient to consider the expected degree models before the actual degree models. 
We consider the models roughly in order of increasing complexity, explaining the underlying assumptions 
and giving the equations from [5 . If we can reduce the model to another model, we explain why this 
should happen and give some details of the mathematical explanation. More complete derivations are in the 
Appendix. 

The Mean Field Social Heterogeneity models require further attention. We have two formulations, one 
in terms of expected degree n and the other in terms of actual degree k. We will not address this initially 
while we discuss the main structure of the hierarchy, but at the end of this section we show that the two 
formulations are mathematically equivalent in the sense that each can be derived from the other. So we 
do not distinguish between the two models in the hierarchy of Figure [2] We will also show that all other 
models reduce to Mean Field Social Heterogeneity in the large (K) limit if (K) and 7 are both constant. 
In turn we have conditions under which the Mean Field Social Heterogeneity model reduces to the Mass 
Action model. Given any model with (K) and 7 fixed, (K) — > 00 and (K 4 ^ / {K) A —> 1, we arrive at the 
Mass Action modeQ 

3.0.1 The Mass Action model 

The Mass Action (MA) model assumes that all individuals have the same rate of contact formation k (or k) 
and contacts are sufficiently short that we may neglect repeated contacts. The transmission rate per contact 
is 0, and so the combined transmission rate is = 0k. If we take S, I, and R to be the proportion of the 
population that is infected, this leads to 

S = -0IS , I = 0IS-jI, R = jl 

However, this may be simplified somewhat by noting that S + 1 + R = 1. We replace the equation for / with 
I = l-S-R 

S = -0IS, I = l-S-R, R = jl 

1 It is worth noting that in the expected degree formulations the average fourth power of the degree and the average fourth 
power of the expected degrees are not equal, but if the ratio with (K) 4 tends to 1 for either, it does for the other as well. 
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Figure 2: The hierarchy structure. The edges summarize the relations between the models. A solid arrow 
from one model to another represents that the target model can be derived as a limiting case of the base 
model. We give heuristic explanations in the text for these and more rigorous derivations in the Appendix. 
A dashed arrow denotes that the target model can be derived as a special case of the base model. The 
dashed arrows are straightforward and the justification is given in the main text. 
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3.1 Expected degree models 

We now study the expected degree models for which each node has an expected degree k assigned using 
the probability distribution function p(n). We allow k to be a continuous variable. At any given time, 
the probability that two nodes u and v share an edge is proportional to k u k v , and each edge is assigned 
independently of all others. 

We briefly sketch the approach used to derive equations in these networks. Full details are in [5]. We 
define as a function of time such that 1 — represents the per-unit k probability of having been infected. 
To be precise let u and v be nodes with and Ak <C 1. If the probability that u is susceptible is 

s(k u , t), then the probability that v is still susceptible is s(k u + Ak, t) = [1 — (1 — Q)Ak]s(k u , t) + 0(Ak 2 ). 
Then taking Ak — > 0, we have ds/dn = (1 — Q)s. So s(k, t) — exp[— k(1 — 6)] and the probability a random 
node is susceptible is S(t) = \I/(0(i)) where 

poo 

Jo 

The difference in the various expected degree models is in how long edges last: they may be permanent, 
fleeting or finite. We begin by considering permanent edges. 

3.1.1 Mixed Poisson 

In the Mixed Poisson (MP) model, the population is assumed to be static, so that if a contact ever exists, 
then it has always existed and will always exist. The governing equations are 

e = -^e + /3^ i i + 7 (i-0) 

R = jl, S = *(Q), I = l-S-R 

3.1.2 Mean Field Social Heterogeneity [expected degree formulation] 

We now consider the opposite limit where contacts are fleeting. The Mean Field Social Heterogeneity (MFSH) 
model [TJ |31 IH [HI IIH] generalizes the MA model by allowing for variations in contact rate among the people. 
At any given time the node is expected to have k contacts, but those contacts change over rapidly. The 
governing equations are 

R = jl, S = V(Q), 1 = 1- S-R 

Note that these equations differ in only one term from the MP equations. 

The only difference in the assumptions of the MFSH model and the MA model is that the MA model 
assumes all contact rates are the same. Indeed, if all expected degrees are the same k, then ^(O) = 
exp[-/t(l - 6)]. We set R = 7 (1 - 6)//3, 1=1- S-R, and S = *(6). We first note that with this *(6), 
we find 9 = —j3I. Setting f3 = /3k and taking the time derivatives of S and R, we see that S = —$IS and 
R = 7/. Thus we have arrived at the MA equations. 

More generally we expect that if the variation in contact rate is sufficiently small, the MFSH model 
should behave like the MA model. In Section [3. 3. 31 we discuss this further. 

3.1.3 Dynamic Variable-Degree 

In the Dynamic Variable-Degree (DVD) model, an individual may create or delete edges at any time. A 
node with expected degree k creates edges at rate ktj. Any existing edge breaks at rate 77, so a node with 



G 



expected degree k will on average have k edges, though the value fluctuates. The governing equations are 



e = 




R = 



Til/, 



n s = *'(e)/*'(i) , 
s = *(e), / 



l-S-R. 



Ilr = 1 - U s - n R , 



The new variables II/, and Hr give the probabilities that a newly formed edge will connect to a 
susceptible, infected, or recovered node respectively. 

If the changeover rate is sufficiently fast we anticipate that the model should reduce to the MFSH model. 
This statment can be made precise by considering a susceptible node u. The node u is constantly creating 
new edges and breaking existing edges. Its risk of infection depends on how many infected neighbors it 
has. If an edge to an infected neighbor is likely to break before the neighbor transmits, then there is little 
correlation between the risk to node u at different times: knowing that a neighbor has not infected u yet 
says little about its current status and so the duration of contact becomes negligible. In mathematical terms 
this means if i]/(3 is large, we anticipate that the DVD model reproduces the MFSH model. This is made 
more precise in the Appendix. 

In the opposite limit, we would expect that having small 77 leads to an effectively static network, so the 
MP model should result. In fact this is true, but the precise condition is somewhat more subtle than might 
be anticipated. It is not enough that r]/j3 and 77/7 be very small because the epidemic can last for many 
generations. In practice the static model will work well at early times, but may fail at later times as the 
contact structure accumulates changes. We set to to be a time early enough that the number of infections 
by time t is very small, but large enough that shortly thereafter the total number of infections is no longer 
negligible. At some later time t the MP model will be reasonable if r](t — t ) is small. A more precise 
condition that the MP model is reasonable if r](Hi + Ujij/r <C 1 where r is the early exponential growth 
rate is also described in the Appendix. 

We show convergence of the DVD model to the MP and MFSH heterogeneity model as 77 — > or 77 — > 00 
in figure [3] The technical mathematical details showing this convergence are in the Appendix. 

3.2 Actual degree models 

For our second class of models, we assume that each node has an actual degree k assigned using the probability 
mass function P(k). The degree must be a non- negative integer. Each individual is given k stubs, and at any 
time those stubs may join in pairs with stubs of other nodes to form edges. In most models we assume that 
the stubs are always in pairs (though the partner may change), so the degree is k. In the Dormant Contact 
model, we allow stubs to be active or dormant, and thus take k m [distributed according to P{k m )\ to be the 
maximum degree of a node, with k a and kd the active and dormant degrees respectively k a + k^ = k rn . We 
use 9(t) to denote the probability that a stub has not transmitted infection from a neighbor to its node by 
time t. The probability a node with a given k is susceptible is 9 k , and taking a weighted average over all k, 
we find S(t) = ip(9{t)) where 



3.2.1 Configuration Model 

The Configuration Model (CM) networks are similar to the MP networks. In a CM network, the exact 
degree of an individual is assigned. A node is given k stubs, assigned using the probability mass function 
P(k). Once all stubs are assigned to nodes, stubs are randomly paired into edges. The resulting network is 
static. We define 6 to be the probability that the neighbor along a random stub from u has not transmitted 
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Figure 3: Convergence of DVD to MP and MFSH models. We consider the DVD model with varying 
values of rj. The probability density function for the expected degree is p(n) = e K /[exp(10) — 1] for n e (0, 10) 
and otherwise. This yields 9(x) = [1 - exp(-10(l - x))]/[10(l - a:)]. We take j3 = 0.2 and 7 = 1. As 
rj decreases (top), the MP model results, while as r\ increases (bottom) the MFSH model results. Note the 
difference in axes from top to bottom. 

infection to u (so 1 — 9 is the probability the neighbor has transmitted). We find that 

it = 7I , S = ip(9) , I = l-S-R 

This is similar to the MP model. In fact, it is possible to show that a MP network is a special case of the CM 
networks. In MP networks, a node u with expected degree k u has its actual degree chosen from a Poisson 
distribution of mean k u (in the limit of a large network). Thus in an MP network the probability a node has 
degree k is P{k) = f™[e~ K K k /kl]p(n) dn. Thus ^(a;) = ^ k P{k)x k = J °° e~ K (J2 K k x k /k\)p(n) dn = 9(x), 
and so the MP model emerges as a special case of the CM model. 

3.2.2 Mean Field Social Heterogeneity [actual degree formulation] 

In the actual degree version of Mean Field Social Heterogeneity (MFSH), each individual has k stubs. Stubs 
change neighbors quickly so that the neighbor at any given time has no bearing on who the neighbor is later. 
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The governing equations are 

ip'(l) 

R = -fI, S = ip(6), I = l-S-R 

Using techniques similar to those for the expected degree formulation of the MFSH model, we can show 
that if all degrees are the same, then this reduces to the MA model. Similarly, if the degrees arc sufficiently 
close to the mean degree in the sense that (K 4 ^ / (K) 4 — > 1 and both 7 and f3 (K) are fixed, then the solution 
again converges to that of the MA model. In section [3.3. 1| we show that in fact this model is equivalent to 
the expected degree formulation of the MFSH model. 



3.2.3 Dynamic Fixed-Degree 

In the Dynamic Fixed-Degree (DFD) model a node is given k stubs, which are paired with stubs of other 
nodes into edges. As time progresses, an edge may break, and the freed stubs immediately form edges with 
stubs from other edges that break at the same time, a process we refer to as "edge swapping". The rate any 
edge breaks is 77. The resulting equations are 



ib" (6) 
ip'(6) 
ib" (0) 

WW 

R = jl, S{t)=i>{6), I(t) = l-S-R. 

The new variables (f>s and </>/ represent the probability that a stub has not transmitted infection from any 
neighbor to its node and is currently connected to a susceptible or infected neighbor respectively. The 
variables were not needed in the earlier models because we can solve for them explicitly in those cases. The 
new variables Tts, 717, and ttr represent the proportion of all stubs that belong to susceptible, infected, or 
recovered nodes. A newly formed edge connects a node to a susceptible, infected, or recovered node with 
probabilities its, ttj, and ttr respectively. 

The DFD model plays the same role in the actual degree case that DVD model played in the expeted 
degree case. It experiences similar limiting behavior. If r]/f3 is large, we recover the MFSH model. Alternately 
the CM is an accurate approximation so long as rj(t — 1 ) is small where to is a time around when the epidemic 
begins to infect significant numbers. Again, a more precise condition that r/(iri + Tr^)/r <C 1 where r is the 
early exponential growth rate is described in the Appendix. 

Figure [4] shows the convergence of this model to the CM and MFSH models as 77 — > or 77 — > 00. 



3.2.4 Dormant Contacts 

We finally move to the Dormant Contact (DC) model which captures all the previous expected and actual 
degree models as limiting cases. In the DC model, each node is given k m stubs [with k rn chosen from P(k m )]. 
However, only a fraction of them are active. At any given time, the node will have k a active stubs and kd 
dormant stubs, so k m = k a + kd is the maximum number of active stubs. Active stubs become dormant 
at rate r] 2 and dormant stubs become active at rate 771. We take ip{x) = J2k m P{k m )x km ■ The governing 
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Figure 4: Convergence of DFD to CM and MFSH models. We consider the DFD model with 
varying values of r\. The degrees are k = 6, k = 8, and k = 10, with probability 1/3 each. This yields 
ip{x) — (x e + x 8 + x 10 )/3. We take /9 = 0.2 and 7 = 1. As 77 decreases (top), the CM model results, while 
as 77 increases (bottom) the MFSH model results. Note the difference in axes from top to bottom. 

equations are 



<Ps = -p<t>i<ps + vi—<pd - m<Ps 

31 i ( t>s ^ L }n} +V1—0D - (V2 + P + 7)0/ 



if/ (6) ' 7T 

772(6* - <j) D ) - r)i4> D 



ip (1) 

7TR = -72Ci7 -m 7r R+ JTTl , ITS = 4>D ~ 77TT ! "7 = 7T - 7TS - 7Tfl 



>'(1) 



»7i ... % 



R = jl, S = 1/3(6), I=l-S-R 

The new variable </>d represents the probability that a stub has not transmitted infection to its node and is 
currently dormant. The variables ns, ki, and ttr now measure the proportion of all stubs which are both 
dormant and belong to a susceptible, infected, or recovered node, with n = tts + wi + ttr = Vz/ivi + V2) the 
probability a stub is dormant. The ratios 775/71", 717/71", and ttr/tt give the probabilities that a newly formed 
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Figure 5: Convergence of DC to DFD, DVD, and MFSH models. We take (3 = 7 = 1. (top) We 
consider the convergence of the DC model to the DFD model as 771 increases. We take ip(x) — (x 2 +x s )/2. For 
the DC model we take 772 = 1 and vary 7^. As 771 increases, the DC model converges to the DFD model result 
for 37 = 1. (middle) We consider the convergence of the DC model to the DVD model as r/i decreases and the 
degrees increase to compensate. For the DC model, we have ip{x) — (x 1 ^ + x 4 ^)/2 where £ = 771/(771 + 772). 
We use 772 = land vary 771. For the DVD model, we take 77 = 1 and ^(rr) = (exp[— (1 — x)]+exp[— 4(1 — x)])/2. 
(bottom) We consider the convergence of the DC model to the MFSH model with 771 = 772 and 771,772 — > 00. 
We take ifi(x) = (x 2 + x s )/2. The MFSH model has the same ip, but uses /3£ = (3/2 as the transmission rate. 
Note that for all three calculations, the 77! = 1 curves are the same. 



edge connects to a susceptbile, infected, or recovered node. The variables £s, £/, and £r give the probabilities 
that a stub is active and belongs to a susceptible, infected, or recovered node, with £ = £g + £j + £r = 
771/(771 + V2) the probability a stub is active. 

It is relatively straightforward to see that if 77! is much larger than 772, then the proportion of time a stub 
is dormant is tiny, tt <C 1. Consequently at any moment a node is expected to have k a = (1 — Tr)k m k m 
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active contacts. As 772/771 shrinks, this approximation improves. So in this limit, the DC model should 
reduce to the DFD model with edges breaking at rate 772. Indeed, in the equations above, if we take 77 = 772 
and assume 771 3> 77, then is negligibly small, so the <p£> terms drop out of the <ps and <f>i equations. 
The values of 7Tg and 717 both go to zero as 771 grows, but a little more care shows that its /ft approaches £5 
and ttj/t: approaches and the £ variables solve the same equations in this limit as the 7r variables in the 
DFD equations. Thus the DC equations reduce to the DFD equations in this limit. More details are in the 
Appendix. 

In the opposite limit, if 772 is large compared to 771, then stubs spend most time dormant, ( < 1. If 
the number of stubs is sufficiently large, the expected number of active stubs k = E[A Q ] = £fc m will not be 
negligible. The variation in the number of active stubs will be significant relative to its expected value k. 
However, k <C k m so the number of dormant stubs kd = k m — k a can be approximated as fc m 7r. The rate 
that dormant stubs become active is 771, so the rate at which the node forms new edges is approximately 
Vi^d — Vik-mTt which itself is 772ft, while each existing edge breaks at rate 772. This is the assumption 
underlying the DVD model. A careful analysis of the equations show that indeed we can reduce them to the 
DVD model if 772 3> 771 . More details are in the Appendix. 

Alternately, we can have the DC model converge to the MFSH model if (K) — > 00, as long as f3 (K) remains 
fixed. In this case we find that the MFSH model is a good approximation, using /3 = /3£ as the transmission 
rate. The underlying argument of this is that as (K) increases, the probability of transmission per edge 
decreases. Consequently, it becomes unimportant whether the edge has long or short duration because it is 
incredibly rare for the disease to try to transmit along the same edge twice. The total infectiousness of an 
individual is then given by the number of active edges times the per-edge infection rate. We can assume 
that only a proportion £ of the stubs are active at any time, and we can absorb this into j3. So an individual 
with k stubs causes transmission at rate j3k where (3 = £/?. This is independent of how (or if) the edges are 
changing in time. 

Figure [5] shows how the DC model reduces in the various limits. 

3.3 Further Analysis of Mean Field Social Heterogeneity models 

In this section we show that the two MFSH models are equivalent. We also show that all the non-Mass 
Action models described here reduce to the MFSH model if the average number of contacts is large, and the 
probability of infection per edge scales like 1/ (K). Technically, our results show that there is no difference 
between the results of any of the (non-MA) models in this limit, so any of these models would be appropriate, 
but generally the equations of the MFSH model are simplest so we use it. 

3.3.1 Equivalence of Mean Field Social Heterogeneity formulations 

There are two formulations of the MFSH model, one with expected degrees and the other with actual degrees. 

In the expected degree formulation, each individual has an expected degree k. On average, at any given 
moment of time the individual has n contacts. The exact number of contacts may vary from moment to 
moment, but how many contacts exist at one moment and who those contacts are with are independent 
from any other moment. Thus over a short period of time, we may safely assume that the total number of 
contacts with infected individuals matches the expected number. 

In the actual degree formulation, each individual is assigned an actual degree k and given k stubs. At 
any time each of those k stubs forms an edge with a stub from another node, but who the stub connects 
to changes rapidly. Again, over a short period of time, we may assume that the number of contacts with 
infected individuals matches the expected number. 

Actual degree formulation as a special case of expected degree formulation Because contacts 
are very short, over any time interval, the total contact time is well- approximated based on the expected 
amount of contact at any given moment. Thus whether the individuals have the same number of contacts at 
each moment, or whether the amount varies from moment to moment, the effect over any macroscopic time 
interval is the same. Thus the actual degree model should be a special case of the expected degree model. 
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Figure 6: Convergence of actual degree formulation of MFSH to expected degree formulation. 

The actual degree formulation of the MFSH model for the appropriate discrete distribution (described in 
text) converges to the expected degree formulation of the MFSH model with contact rates chosen uniformly 
from to 10. 

In the expected degree formulation, the probability that a node with k expected contacts is susceptible is 
exp[— k(1 — ©)] and in the actual degree formulation the probability that a node with k contacts is susceptible 
is 9 k . If « = k, we expect these to equal, and so we anticipate 9 — exp(0 — 1), or 6 = 1 + In 9. Plugging 
this into the expected degree equations, we arrive at the actual degree equations. So the two methods are 
equivalent subject to a change in variables. 

Expected degree formulation as a limiting case of the actual degree formulation The fact that 
the actual degree formulation leads to the expected degree formulation is based on similar reasoning. The 
underlying additional idea is that any continuous distribution can be approximated by a sufficiently well- 
refined discrete distribution. We then need a way to take a well-refined discrete distribution and rescale it so 
that all the probability is massed at integer values. To do this we make the observation that we can multiply 
everyone's (expected or actual) degree by L with no impact on the epidemic so long as we also divide the 
transmission rate by L. 

To make this more precise, consider a given (3 and p(n) and assume L is large. We approximate the contin- 
uous distribution of k using a discrete distribution where the probability of k/L is J^J^ d«. Increas- 
ing L gives a finer scale approximation. We then multiply every k/L by L, to get P(k) = J^/J p( K ) d« 
where k is an integer. So long as we divide j3 by L, the spread of the epidemic on a population with the 
given P(k) and the rescaled f3 will closely approximate the epidemic spread in the original expected degree 
population, with the approximation improving as L oo. We find that the actual degree equations converge 
to the expected degree equations as L — > oo. Full details are in the Appendix. 

As an example we take the uniform distribution from to 10 for k. So ^(x) = (1 — exp[— 10(1 — 0)])/[2(l — 
6)]. We initially start with a discrete distribution on the integers through 9 with weight 1/10 on each. So 
our initial ip(x) is Y^k=o a;fc /10 = ( xW ~ 1)/10(se — 1) with fj = f3. We then refine the distribution. For given 
L we have ip(x) = Y^k^o 1 x k /10L = (x 10L — l)/[10£(x — 1)], and we take /3 = P/L. Convergence is shown 
in figure [6] 

3.3.2 Reduction of all models to Mean Field Social Heterogeneity model at large average 
degree 

One particularly important limit corresponds to people having many contacts, but a low probability of 
transmitting per contact before recovering. That is, (K) is large, and j3/j is comparably small. In this limit, 
we expect that the duration of contact becomes unimportant, because the infection is unlikely to cross an 
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Figure 7: Convergence to MFSH. (top) Convergence of the Expected Degree models assuming a uniform 
distribution from to 2 (A) . (bottom) Convergence of the Actual Degree models assuming a third each with 
(A) /2, (K), and 3 (K) /2 stubs. The solid curves correspond to (K) = 1 (top) and (K) = 2 (bottom). The 
dotted curves correspond to various larger values of (K). Because /3 decreases to balance the increase in 
(K), the MFSH curves are the same for all (K). 



edge more than once so it has no way to know how long the edge lasts. Heterogeneity in contact levels may 
still play an important role. If the heterogeneity is sufficiently small, it is reasonable to expect that the MA 
SIR model is appropriate. 



In section 4.2 we rigorously derive the MFSH model from the DC model assuming (A") — > oo with fixed 
(3 (K) and fixed 7. A similar proof will apply for the other models, or we can simply argue that the DC 
model reduces to all of the others, and careful attention to detail shows that they inherit this limiting case. 

3.3.3 Reduction of MFSH to MA model if (A 4 ) / (A) 4 -> 1 and (3 (A) fixed 

It is straightforward to show that the MFSH model becomes SIR model if every node has the same contact 
rate. However more generally, we would expect that if the contact rates are sufficiently close together, the 
model should behave like the SIR model. In fact this holds if (A 4 ) / (A) 4 — s- 1 with /3 (A) fixed. Intuitively, 
if the number of nodes with higher or lower contact rates is very small, their contribution to the spread of 
the disease is not significantly different from the contribution of an average node, and so the disease should 
spread as if the contact rate were homogeneous. We do not have a good intuitive exp lanation for why the 
precise condition relies on (A" 4 ) / (A) 4 . However, we prove it rigorously in section 



4.3 



We expect this case to be particularly relevant if the average degree is large and infectiousness is low. 
Combining this with the previous result, we conclude that all models converge to the MA model under 
appropriate conditions as (A) — > 00. 
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Figure 8: The expected degree distributions for our examples of convergence as (K) — > oo, with and without 
{K 4 )/(K) 4 ^l. 
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Figure 9: Sample convergence as (K) — > oo. We use the exponential distribution described in figure [8] 
The distribution of expected degrees decays exponentially and (K 4 ') / (K) 4 does not approach 1 as (K) 
grows. The solutions appear to converge, but they do not converge to the MA SIR model. 

4 Rigorous proof of convergence to Mass Action equations 

In this section, we give a rigorous proof of one of the most significant results. Namely, if (K) is large, but 
(K 4 } I {K 4 ) is approximately 1 and /3 (K) and 7 are small compared to (K), then regardless of which model 
we use, the result is well-approximated by the MA equations. 

We first show some examples, and then provide the details needed to prove the result for the DC model. 
The simpler models can be derived in the same method (though in some cases the proof will be simpler). 
For our proof, we assume we have a sequence of populations and diseases indexed by n such that (K) —} 00, 
(K 4 ^ j (K) 4 —> 1 with £/3 (K) fixed. We show that this converges to the mass action equations using 
P = tf{K). 

We show this in two steps. First, we show that in the limit of large (K) and constant (K), the DC 
equations converge to the MFSH equations. We then show that the MFSH equations converge to the MA 
equations if additionally (K 4 ) / (K) 4 —> 1. 
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4.1 Example 

As examples, we consider several different population structures using the MP model in figure[8| In the first, 
we take an exponential distribution of expected degrees. To vary the average degree, we change the decay 
rate of the exponential. 

This does not satisfy the conditions that (K 4 ^ / {K) A — > 1. In the other examples, we take a uniform 
distribution, with expected degrees chosen uniformly from ((K) — (K} a , (K) + (K) a ). That is, 

f k<(K)- (K) a 

/>(*) = { w <*> - < K < W + W a 



This satisfies the condition for any a < 1. Using \K ) to denote the average of the 4th power of the 

expected degree 2 we have IkA = (K) 4 + 2 (K) 2+2a + (K) 4a /5. However, the condition fails for a = 1. 

The distribution would not make sense for a > 1 since some nodes would have negative expected degree. 

For all distributions, we take (3 = 2/ (K) and 7 = 1. The corresponding MA model has (3 = 2. In figure[9] 
we plot the results for the exponential distribution. This distribution does not satisfy the conditions, and we 
see that the solutions do converge, but not to the MA model. In figure 10 we take the uniform distribution 
from (K) — (K) a to (K) + (K) a . We again see that when the conditions are not satisfied, the solutions 
may still converge, but not to the corresponding MA model. However, when the conditions are satisfied, the 
solutions converge to the MA model. 



4.2 DC to MFSH 

We will use the following result several times in the proof: Assume g(t) is a function of time and 

g = -c(t)g + f(t) 

If c{t) > C > for all time and |/| < F, and further \g(0)\ < F/C, then for all time \g(t)\ < F/C. To 
see this, assume that / and g are positive. If g is increasing beyond F/C, then —eg + f must be negative, 
violating the assumption that g is increasing. Similar results hold for negative values of / and g. Our proofs 
skip some of the intermediate steps. Complete details of all the algebra is in the Appendix. 

We want to prove that if degrees increase, but simultaneously the infection rate decreases with 7 constant 
then we must arrive at the Mass Action equations. We do this for the Dormant Contact model. We introduce 
the notation o(l) to denote a function whose value goes to zero as (K) —> 00. We will show that neglecting 
small terms in the large degree limit the DC model becomes 

9 = -pec, 

R = 7/ , S = ip(9) , I = l-S-R 

where C = 1 - 9ip'(6)/ip'(l) + (j/(3~)\n6. 

We define C = ^1 + €1 This is the proportion of all stubs which belong to infected nodes. We prove a 
series of results: 

1. 9 = l + o(l). 

2. (j} D =7T + o(l). 

2 Our condition that (ii" 4 ) / (K) 4 — > 1 applies equally for the fourth power of actual or expected degree since at leading order 
they are the same. 
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Figure 10: Sample convergence as (K) —> oo. We use the uniform distribution described in figure [8j In 
the first, second, and third rows, we use a = 1, 0.85, 0.5 respectively. The condition fails for a = 1 and the 
solutions do not converge to the MA model, but they converge to the MA model for smaller a. 

3- *5 = e$g+o(i). 

4. 7rz = 7rC + o(l) & = £C + o(1). 

5. /= £0C + o(l) 

6. = -/36»C + /3o(l). 
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7- C = l-0f^j + (7//3)ln0 + to(l) 
If we drop the error terms in the final two results, then we have the equations governing the Mean Field 
Social Heterogeneity model in the actual degree case, with /3 = /3£ playing the role of the transmission rate. 
The existence of the error terms means that at earlier time, the approximation is better, and it can deviate 
more as time increases. The solution converges uniformly for any t less than any arbitrary chosen value T as 
(K) grows. So for large enough (K), the region over which the MFSH model gives an accurate approximation 
will include the entire period of the epidemic. 

1. = l + o(l). 

We trivially have 1 > > 1 - /?/(/? + 7). Because /3 (K) is constant, we find j3 = o(l). Thus 
1 > 9 > 1 + o(l) (with a negative error term), and so we can conclude that = 1 + o(l). 

2. cj) D = 7r + o(l). We show <p D — 7T = o(l). 

d 



dt 



{<t>D - 7T) = 772(0 - fa)) - Vl4>D 

= 772 [1 + o(l)] - (771 + T] 2 )fa) 

= (m + V2)tt - (vi + m)4>D + 0(1) 
= -(771 + m){ ( i>D - 7r) + 0(1) 

In the next to last step we used the fact that tt = 772/(771 + 772) so 772 = (771 + 772)77. So fa) — tt is at 
most o(l)/(77i + 772) = o(l). 



3. 



We have 



o(l). 



d 

dt 



V'(l) 



-r} 2 fa + Vi — 4>d 



-m<t>s + Vi 7r 



iP'(6) 



no) 
no) 



fa -z 



tA'(I) 

no) 



lP'(l) 



0(1) 



t//(1) ^ V"(0) 

where we use the fact that 7Tg = <p£>ip'(0)/ip'(l) and fa) = 7r + o(l) and that ip' (6) / 'ip' (1) < 1 (since ip 
is a convex function and < 1). Since 77177 = 772^, we have 



d 
dt 



'V'(l) 



-772 



= -\ m 



pfa 



no) 

t//(1) 
<P'(0) 



fa-t 



no) 

7^(0) 

iP'(6) 



no) 



v(i) 



7^(1) 
0(1) 



0(1) 



Since ip" (6) / ip' (0) is nonnegative, the coefficient of fa — £ip' '(0) / 'ip' '(1) in the final expression is at least 
772, so we conclude that 05 = £ip' (9)/ip' (1) + o(l). 

4. 7r / = 7rC + o(l) 6 = eC + o(l). 

We first define q = (-fa)+n6)ip'(6)/ip'(l). Since = l + o(l) and <p D = tt + o(1), wc have -faj + nO = 
o(l). Since < ip'(9) < t//(1), we have q = o(l). We will show that 717 — ir( — q = o(l) 



— (717 - ttC - q) = -tV s - 7717 - ttC - q 



d_ 
"dt 



W) 



77T/ 



d 

"dt 



Tyy(l) 



7C 



So 717 — 7r£ ■ 



>'(1). 
= -7(717 + ttC - g) + o(l) 

<7 = o(l), and 717 = 7r£ + o(l). A similar proof shows that £j 



d 

dt 



(-0/J+7T0) 



t//(0) 
TA'(l) 



€C + o(l). 
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5. 0j=#C + o(l) 

We will actually show that (pi + 4>s — + £,9 2 ip' (9)/ip' (1) + o(l). Once this result is shown, we use 
the fact that (f> s = £ip'(e)/ip'(l) + o(l) = £6>V (^WW + o(l) to get our final result. We begin by 
taking the derivative of 0j + S - £[#C + #V 



d_ 
dt 



V'(i) 



oil/' (8) ■ ip'(O) 



V'(i) 



^'(1) 



After some manipulations (see Appendix) we have 



d_ 
dt 



i/ + <t>s - £ 



9C + e W) 



= -{m + 7) \<i>i + <f>s-£ 



9( + 9 



So (f> I + (j>s-£[6C + 9 2 ip(9)/ip , (l)} = o(l). Because = l + o(l) and </> 5 = ^£>'(0)/y(l) +o(l) it follows 
then that 0/ =£9( + o(l). 

6. = -/36»C + /3o(l). 

This step is trivial. Since <j)i = £0£ + o(l), we have 

9 = -fifa 

= -pec + po(i) 

7 - ( = l-9p$ + ( 1 /P)\n9 + to(l) 
We make the observation that £ 



■^[^'(^/^'(l)] ~ 7C We have 



s dt 



d 
"dt 



#(1) 



^ d 



? ^(g) 
#(1) 



7 



From our previous result we have ( = —[6 + f3o(l)]//36. Substituting this in we have 



dt 



l-^ + (7/W 
= (1) 



7 7/3o(l) 70 
~/30 /§0 /30 



Our result follows immediately. 



So the DC model reduces to the actual degree formulation of the MFSH model. Because the expected 
degree formulation contains the actual degree formulation as a special case, it suffices now to prove that the 
expected degree formulation converges to the MA model in the appropriate limit. 



4.3 MFSH to MA 

Our proof that the MFSH model converges to the MA model does not require that (K) — > oo, but it does 
require (-ft' 4 ) / (-ft") 4 — > 1. However, in order to conclude that the other models approach the MA model, we 
need the conditions of the previous section to hold as well. 

Wc first make the observation that if (-ft" 4 ) / (-ft") 4 — > 1, then (-ft" 2 ) / (-ft") 2 — > 1 as well. By Jensen's 
inequality, we have (K) 2 < (K 2 ) < ^/W). So 1 = (K) 2 / (K) 2 < (K 2 ) / (K) 2 < ^iK*)/(K) 2 = 

\J (-ft" 4 ) / (-ft") 4 — >• 1. Sp (-ft" 2 ) I (-ft") 2 is bounded from below by 1 and from above by something converging 
to 1. 
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We first prove two lemmas. We define s\ and S2 by 



si 



1 *"(8) 



(JO 



5 



We show s\ and S2 go to zero. We have 



lim | s x | = lim 

n— >oo n— >oo 



lim 



*"(e) 



S)( K 2 -(A 2 ))d K 



(see Appendix for details). We now use the Cauchy-Schwarz inequality to bound this. We have 



J- (e -«(i-e)_ 5)(K 2_ (if2))dK 



(KY 



< 



< 



| f °°(e-^-^ - S) 2 p{n) d«| V*| J °°(k 2 - (A 2 )) 2 d«| V* 



\J~{k 2 -{K 2 )) 2 &^/ 2 
(A) 2 



where we use the fact that | exp[— n(l — 8)] — S\ < 1 to show that the first term in the numerator of the first 
equation is at most 1. All that remains is to expand the numerator and bound it. 

|/ oo (^-(^))2 dK |i/ 2 _ \^ { ^-2k 2 (K 2 ) + (K 2 ) 2 )&^/ 2 



(KY 



(A) 4 



(KY 

1/2 



But our assumption is that asn-> oo, both (A 4 ) / (A) 4 and (A 2 ) / (A) 2 go to 1. Thus this goes to zero as 
n — > oo. This completes the proof that Si — ¥ 0. 
To show that S2 — > is similar. 



*2 



1 

(A) 

r c 





*'(8) -S 



J°° ie -«lS)-S)(K-(K))p( K )dK 



< 



< 



(A) 

| /-(e-d-e) - S)p(K) d«|V2| /~(« - (K)) p {k) &k\V 2 



|(A 2 )-(A) 2 |V 2 
(A) 



(A) 



and as n — ¥ oo, this tends to as well. 

We have our bounds on Si and S2, so we are now ready to complete the proof that the Mean Field Social 
Heterogeneity model converges to the Mass Action model as n —> oo. 

We define II/ = -G//3 = 1 — *' (8) /*'(!) — 7(1-0)//?. In 5| we saw that this represents the probability 
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that a new neighbor is infected. We expect 11/ to be very close to I and set y = 11/ — I. Then 



y = Uj-I 

■ V"(Q) 

= ~ 9 W) +76//3-[-S-JZ] 

= - ^r/ + 7©/^ - [-6*'(6) - 7 /] 

= ^-^y ~ tH/ - /3n z *'(e) + 7 i 

= + y)^kf -~iy- W + y)*'( e ) 

= /?<*-) (J + y )[(5 + ai )-(S + a a )]-7i, 
= /?U + y)(si - s 2 ) -7J/ 

We know that I + y is at most 1, and si and s 2 both tend to zero as n increases, so we have y approaches 
— 72/ as n -> oo. Note also that at early time 11/ and / are both very close to so y begins as a very small 
number. It is straightforward to conclude that y can never grow larger than the maximum value of s\ which 
tends to zero asn^-oo. Consequently, y — ¥ as n — > oo. 
We finally have 

S = -/3H/#'(6) 
= -PIS - foS - Pys 2 

Since y and s 2 both tend to as n —¥ oo, we have S = —j3IS. This means that we have the Mass Action 
model in the limit. 



5 Discussion 

We have shown the relations between the various edge-based compartmental models for infectious disease 
spread derived in [5]. We used this to develop a flow chart (figure [T]) which can guide the choice of appropriate 
model for a given population. 

We note that while the edge-based compartmental models allow us to capture effects that were previously 
inaccessible through analytic techniques, there are still many effects that are not captured by the models 
considered here. Our flow chart does not address these. It is always prudent to consider the disease and 
population to ensure that the assumptions of our models are not strongly violated. A number of adaptations 
of the mass action SIR models exist for populations in which the disease has multiple stages, or the population 
has important substructures. In our next paper [6] we show that the general edge-based compartmental 
modeling approach can be used to accomodate many of these effects. 

There is one interesting open question which we call attention to. We showed that if (K) — > oo, 
(K 4 ) I (Ky -> 1 with /3 (K) fixed, the models converge to the Mass Action model. However, there are 

many cases where (if 4 ) / {K) A approaches some other value, and our calculations appear to converge. It 
would be interesting to identify what the relevant reduced equations are in this limit. 
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A Appendix 

In this appendix we show more rigorous arguments underlying many of the claims in the main paper [7] . We 
show that the various edge-based compartmental models presented in 5 a reduce to one another in appropriate 
limits. 

We give one preliminary result that will be used several times. For constant c > 0, if 

X = f(t) — cx 

then so long as |x(0)| < sup t \f(t)\/c, then for all time \x(t)\ < sup 4 f(t)/c. To see this, simply note that 
x < if x > sup |/|/c, and x > if x < — sup |/|/c. From this fact we conclude that x can never increase 
past sup |/|/c or decrease below — sup |/|/c. More generally, if c is not constant, but bounded away from 0, 
then \x(t)\ < sup t /(t)/inf t c(t). 

B Limiting cases of the DVD model 

We begin by considering limiting cases of the DVD model. The equations of the DVD model are 

e = -^e + /?|^+7(i-e) + T 7 ('i-e-^n Ji ) , 
ii R = jU z , n s = *'(e)/#'(i), nj = i-n s -n B) 

R = jl, S = *(©), I = l-S-R. 



B.l Convergence of DVD to MFSH if rj/fi is large 

For a heuristic argument that large values of rj lead to the MFSH model, it is easier if we also assume that 
77/7 S> 1. We do this initially, and then show a more rigorous proof without this additional assumption. 

The variables 6, *'(9)/*'(l) and I - © are all between and 1. We write © = O + ©1/77 H and 

use similar expressions for 11$, Hi, and n^. Since ©, ^ r '(©)/^ ,/ (l), and 1 — are all between and 1, and 
ft and 7 are small compared to 77, the first terms of the O equation are negligible at order 77. At order 77 the 
equation becomes 

= 77 u - e - ^n fl , 

And so at leading order ©0 = 1 — PHr,o/i- Looking at the equation for n# we find II^o = 7^,0 and so 

eo = --n fl) o 
7 

= -/3n J>0 

= -p (1 - n s , - n R , ) 

~/>-/>^ + **-i) 

This is the equation governing the evolution of in the MFSH model. The equations for S, I, and R are 
the same in both models. So if rj/(3 3> 1 and 77/7 1, the DVD model reduces to the MFSH system. 
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To be more rigorous and eliminate the assumption 77/7 3> 1, we should show that the value of the error 
made by using the MFSH equations cannot become large. To do this, we set £1 = 9 — 1 + /3IIr/7, so 
6 = + 1 — (3Hr/7. From the 9 equation wc find 

n-fa = -pe + P^ + ri(i-e-fa 
n-/m I = -i3Q + !3-^-±-rfii 



*'(i) 7 ^ 

f2 = -j3{Q - 1 + IIr) - r)Q 

The value of |0| can never be larger than the maximum value of |/3(9 — 1 + Hr)\. Our assumption is that 
(3 is small compared to 77, so the error in 9 remains small. Thus 6 is approximately 1 — (3Hr/j, and the 
equations we derived above will hold. 



B.2 Convergence of DVD to MP if r}(t - t ) is small 

When r] is small, the term relating to edge swapping becomes small in the 9 equation. Here wc write 
9 = 9o + r/Qi + ■ ■ ■ and similarly expand the other variables. The 9o equation immediately matches the 
equation of the MP model. As long as 7j9i is small, the solution is closely approximated by just solving for 
9 . We find that 

Since 9i is initially 0, we can see that it takes time for r/Qi to grow to an appreciable size (and the time 
is longer for smaller rj). We have not been able to find a similar bound for the error as in the MFSH case, 
and this is because if the epidemic lasts sufficiently long, the creation and deletion of edges will rearrange 
the contact structure during the course of the epidemic, and so the error terms can become significant. 

A more precise bound can be found by asking "how many edges have been created or broken with nodes 
that were no longer susceptible?" As long as the edges being changed are connected to susceptible individuals, 
there is no effect on the spread of the epidemic, but if a node creates a new edge to a recovered rather than 
susceptible individual, this does alter the spread. Early in the epidemic Hi + Hr grows exponentially and 
may be taken to be Ce rt for some r depending on the parameters of the disease and population. The rate 
that edges with these nodes are formed or broken is proportional to rjijij + Hr). Integrating this over time 
we have J_ oo rjCe rT dr — ijCe rt /r = ^(Uj + U R )/r. So a measure of the quality of the approximation is 
whether or not rj{Y\i + Wp)/r is large. This captures the fact that if the edge changeover rate is slow (77 
small), or the timescale of the epidemic is short (r large), or the proportion of the population affected by 
the disease is small (III + Ur small), then the impact of edge creation and deletion is not significant. 
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C Limiting cases of the DFD model 



The equations for the DFD model are 

9 = -Hi , 

ip" (6) 
ip'(6) 
ib" (9) 

4>i = ^i^s^joj + - (P + 7 + V)4>i , 

WW) i 

7TR = 77T/ , ir S = , , , 7T/ = 1 - TT R - TT S , 

V>'(i) 

R = -yI, S(t)=il>(9), I(t) = 1-S-R. 

C.l Convergence of DFD to MFSH if rj/P is large 

As before the simple argument is easier if we also assume that 77/7 is large. Expanding all our variables in 
terms of 1/rj, we see that at leading order 4*1$ = Oirifl- We use this observation to reach two conclusions. 
First 

kr.o = 7717,0 



7- 



so TfRfi = — 7[ln0]//3. We further find 



71 



-/36»(1 - ir s ,o - Tr Rfi ) 

6^(6) 7 ln6» 



-136(1 



^'(1) /3 



and so at leading order in I/77, 9 solves the MFSH equations. 

We can make this more rigorous as we did for the DVD to MFSH case, and we find that we only require 
1)1(3 to be large. check this 



C.2 Convergence of DFD to CM if r}(t - t ) is small 

The argument here is effectively the same as the DVD to MP case. We just make the observation that 
ignoring the r\ term, we find that 4>s = ip' {9) / ip' (1) , so we are able to show that <pi also takes the same 
solution at leading order as in the CM case. Thus 9 solves the same equation as in the CM case and the 
argument is done. 

Again we can find a more precise condition that if 77(717 + nji)/r is small where r is the exponential growth 
rate of the epidemic, then we can neglect the impact of edge swapping. 
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D Limiting cases of the DC model 



The equations for the DC model are 

= -Hi , 



<Ps = ^P^^s^i^ + Vi — Vd ~ V2<Ps , 

0/ = Hi^s^j^ +vi 7 ^r<i>D - (m + P + i)<t>i , 

<pD = V2(6 - 4>d) - f]\4>D , 

£r = -V2^r + VWR + 7& . = (0- 0d)^tt^| , €i = £-£s-tR: 

V>'(i) 

J. ^(0) 

KR = m^R -m^R+ 1^1 ) K S = (pD ~~ 7T7TT , 1TI = IT - ITS - TTR , 

c vi m 

£ = , , = 



vi + V2 m + m 

R = -yI, S = ^(6), I = l-S-R. 

D.l Convergence of DC to DFD as n ->■ 

Since 7r = 772/(7/1 + ry 2 ) , we know that as 7r — >• 0, we must have 772/^1 to 0, and in fact 7r — 772/^1 = 
~V2/( 7 li + ^l 7 /^) <C 7r. So 7r is well-approximated by 772/771. Using this and the equation for <fi D , it is possible 
to show that <f>D — Tt0 at leading order. 

From this we quickly see that its — ir&ip' (8)/ip' (1) = ir£s + 0(ir 2 ). A little more effort shows that 
Tr = tt£i + 0(tt 2 ) and ttr — 7t£r + 0(ir 2 ). We then find at leading order 

ip" (6) 

4>s = -fields + m£,sS - V20S 

h = P<i>i4>s^jjr + V2&0 - (172 + P + i)h 

where we have used the fact that r\\<\>D = V2Q at leading order. 

We can also show that the £ variables in the DC model follow the same equations as the 7r variables 
in the DFD model. Letting rj 2 play the role of 77 n the DFD model, our DC equations reduce to the DFD 
equations. 



D.2 Convergence of DC to DVD as n — > 1 

Consider the variable-degree model with a given p(n). We want to build a dynamic fixed-degree network 
that mimics this. Epidemics on this network may be modeled using the dormant contact equations, but the 
network will behave very much like a variable-degree network. Our approach is as follows: we will take some 
I> 1 and consider the dormant edge model where each node that would have expected degree k has [Lk\ 
stubs (where [-J is the integer part) with the edge swapping occurring in such a way that only a fraction 
1/L of the stubs are active. This is done by having existing edges break at rate 772 = 77 and dormant stubs 
finding neighbors at rate 771 = r)/L. The leading order terms (in L) in the equations of the dormant edge 
model will be the same as the variable-degree model. 

As L — > 00, we anticipate that it = 1 + 0(1/ L) and (pp = 1 + 0(1/ L). We also expect that £ variables 
are all 0(1/ L) and negligible in comparison to all other terms. We set P(k,L) — J^J^ 1 ^ L p(k) dn which 
corresponds to taking all the weight of k and assigning it to [Lk\ . Thus a node that would have expected 
degree k with k < nL < k + 1 instead gets k stubs with k a close approximation to Ln. Define ip(x, L) = 
Y, k x k P(k,L). 
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We set 



Through the substitutions 



Jo 



-k(1— x) 



p(n)dn 



X 



TS = ns7r 



7Tj = il/7r 



772/771 = £ 



V2=V 



and starting from the fixed-degree equations we arrive at the variable-degree equations in the X — >• 00 limit. 
To be fully rigorous, we would define 9, $5, II5, etc. to be the limits as X — > 00. 

To show this, we first show that ?/>(#, X) = *(6) + 0(1/L). Take X > 1 to be given. We have 



-(k+l)/L 



k=0 

00 

= £< 

fc=0 

00 

= E 

fc=0 
/■oo 

JO 

POO 

JO 



J 



p(n) dn 



k/L 
(k+l)/L 



6 Lk P (k) dn 



k/L 

9 Lk P {k) dn + 0(l/L) 
e -«(i-e) p ( K ) d/c + 0(l/L) 



•0(1/L) 



= *(0) + O(l/L) 

Note that d/d9 = (6/L)d/d0, and when X is large, 9 = 1 + 0(1/ L). From this it follows that for n = 1,2, 
ip( n \6,L) = X™*(™)(6) + ©(X"- 1 ). We now take the DC equations and substitute these values for the 
variables. Holding on to just leading order terms in the equations and observing that $5 = ^'(0)/*'(l), we 
see that the variables scale as anticipated, and the equations reduce to the dynamic variable-degree equations 
at leading order. 

D.3 Convergence of DC to MFSH as rji/p — > 00 with 772/^1 fixed 

The argument for this is much the same as in the DFD to MFSH and DVD to MFSH. One modification is 
needed. We note that many edges in the DC model will not be active at any given time: the proportion of 
time an edge is active is £. The appropriate transmission rate in the MFSH model is thus $ = Taking 
the leading order equations in I/771, and following the same basic approach as before, we arrice at the actual 
degree formulation of the MFSH equations. 

E Convergence of MFSH to MA if (K 4 ) / (K) 4 -> 1 

We first make the observation that if (XT 4 ) / (XT) 4 — > 1, then (X' 2 ) / (XT) 2 -+ 1 as well. By Jensen's inequality, 
we have (K) 2 < (K 2 ) < y/(K^. So 1 = (K) 2 / (K) 2 < (Xf 2 ) / (K) 2 < (K)* = ^ (K^) / (K) 4 -> 1. 

Sp (XT 2 ) / (XT) 2 is bounded from below by 1 and from above by something converging to 1. 



26 



Let us assume we have a sequence of populations and diseases (indexed by n) such that 7 and j3 (K) = f3 
arc held fixed. Further assume that (if 4 ) / (if) 4 — > 1 as n — > 00. Wc will prove that the solution of the 
MFSH model converges to the solution of the Mass Action model. 

We first prove two lemmas. We define s\ and s 2 by 

S1 _ 1 s 

1 " (K) *'(1) 
s 2 = T^-tf^GJ-S 



We show si and s 2 go to zero. We have 



lim \si I = lim 



lim 

n— foo 



*"(©) 



= lim 

n— too 



lim 

n— J-oo 



= lim 

n— >oo 



/ °° K 2 e-^-^p{n)dK 



-S 



/ °° K 2 p(n) dn 



(if 2 ) 
J °° k 2 /?(k) d/c 



(KY 



(K 



,\2 



/o°°( 



»/ e -K(i-e) 



S)(k 2 - (K 2 ))&k 



(KY 



where in the next to last step we use (if 2 ) — > (KY , and in the last step we note J(exp[— k(1 — 0)] 
5) (-?f 2 ) dn = 0. We now use the Cauchy-Schwarz inequality to bound this. We have 



r 00 { - 
Jo ( e 



-re(l-e) 



S)(k 2 -(K 2 ))dn 



(KY 



< 



< 



I /"(e-"* 1 - 8 ) - 5) 2 p(«) d«| 1 /2| J~( fc 2 _ ^2^2 d/s |i/ 2 



(if)' 



|Q^-(jg))^cj^ 

(if) 2 



where we use the fact that | exp[— k(1 — 0)] — 5| < 1 to show that the first term in the numerator of the first 
equation is at most 1. All that remains is to expand the numerator and bound it. 

\^{n 2 -(K 2 )) 2 d^/ 2 \^{^-2n 2 (K 2 ) + (K 2 f)d^ 2 



(KY 



- (if 2 ) 2 



(KY 
1/2 



(KY 



But our assumption is that as n — > 00, both (if 4 ) / (if) 4 and (if 2 ) / (if) 2 go to 1. Thus this goes to zero as 
n — >• 00. This completes the proof that si — > 0. 
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To show that s 2 — > is similar. 



S2 



(K) 



*'(0) - S 



/ °° kc-^-^pCk) d« S 1 J °° k>p(k) dn 



(K) 

J-(e-a-e) 



5)(k- (K)) P (k) dn 



< 



< 



< 



(K) 

| /-(e-t^} - S)p(«) d«| V 2 ! / °°(« (K))p( K ) d«|V 2 

(if) 

|(if 2 )-(if) 2 |V 2 



and as n —> oo, this tends to as well. 

We have our bounds on S\ and S2, so we are now ready to complete the proof that the Mean Field Social 
Heterogeneity model converges to the Mass Action model as n — > 00. 

In [5], we showed that IIj = 1 - *'(9)/*'(l) - 7(1 - 6)//3. We expect II/ to be very close to I. Set 
y = IIi - I- Then 



II/ -I 

*"(0) 



-e 
-e 



*'(!) 
tt"(6) 

*"(0) 



/8(/ + »)- 



+ 7 9//3 - [-S - R] 

+ 7 e//3- [-e*'(0)- 7 /] 
- 7 n 7 -/3n/*'(e) + 7/ 
7 y-/3(i + y)*'(e) 



fr"(9) 

(3(K) (I + y)[(S + Sl ) - (S + s 2 ) 
f3(I + y)(s 1 - s 2 ) - 7 y 



72/ 



We know that I + y is at most 1, and si and S2 both tend to zero as n increases, so we have y approaches 
— 77/ as n — > 00. Note also that at early time II/ and I are both very close to so y begins as a very small 
number. It is straightforward to conclude that y can never grow larger than the maximum value of s\ which 
tends to zero as n — > 00. Consequently, y — ¥ as n — > 00. 
We finally have 

S = -/3H/*'(e) 

= -^n J (S , + a 2 ) 

= -pIS - $yS - $ys 2 



Since y and s 2 both tend to as n — ¥ 00, we have S 
model in the limit. 



-0IS. This means that we have the Mass Action 
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F Dormant Contact model to Mean Field Social Heterogeneity 
model as (K) — > oo, /3 = with j3 (K), 7 constant 

We want to prove that if degrees increase, but simultaneously the infection rate decreases with 7 constant 
then we must arrive at the Mass Action equations. We do this for the Dormant Contact model. We introduce 
the notation o(l) to denote a function whose value goes to zero as (K) — > 00. We will show that neglecting 
small terms in the large degree limit the DC model becomes 

9 = -m 

R = jl, S = il>(9), I = l-S-R 

where C = 1 - 6>V'((9)/V>'(1) + (7//?) hi (9. 

To be more rigorous, we define ( — 717 + £/ This is the proportion of all stubs which belong to infected 
nodes. We prove a series of results: 

1. 9 = l + o(l). 

2. 4> D =7T + 0(1). 

3- <t>S=Zp$+o(l). 

4. 7rj = 7rC + o(l) 6 = eC + o(l). 

5. 0j=#C + °(l) 

6. e = -jies + po{\). 

7- t = l-6p$ + ( 7 /P)]n9 + to(l) 

If we drop the error terms in the final two results, then we have the equations governing the Mean Field 
Social Heterogeneity model in the actual degree case, with /3 = /?£ playing the role of the transmission rate. 
The existence of the error terms means that at earlier time, the approximation is better, and it can deviate 
more as time increases. The solution converges uniformly for any t less than any arbitrary chosen value T as 
(K) grows. So for large enough (K), the region over which the MFSH model gives an accurate approximation 
will include the entire period of the epidemic. 

1. 6 = l + o(l). 

We trivially have 1 > 9 > 1 — /3/(/3 + 7). Because (3 (K) is constant, we find /? = o(l). Thus 
1 > 9 > 1 + o(l) (with a negative error term), and so we can conclude that 9 = 1 + o(l). 

2. (f> D =7T + o(l). 

We seek to prove that (f)D is approximately 7r. The way we will do this is to prove instead that 
4>d — = o(l). Since 4>d = tt before the disease is introduced, the error never has a chance to be larger 
than o(l). We will use this approach for most of our remaining results. 

^(0D - 71") = 772(0 - </>d) - Vl^D 

= 772 [1 + O(l)] - (?7l + 7] 2 )<j) D 

= (t?1 + 772)7T - (771 + 1] 2 )4>D + 0(1) 

= -(vi + m){(t>D - 7r) + o(l) 

In the next to last step we used the fact that w = 772/(771 + 772) so 772 = (77! + 772)71-. So 4>d — ^ is at 
most o(1)/(t7i + 772) = o(l). 
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>V'(i) 
We have 



d_ 
dt 



= (f>s-& 



no) 

= -f]i4>s + m — 4>d 



no) 



= ~V2<f>S + Vl 



= -m4>s + vi n 



4i tm 



^'(1) 



- P<f>i 



no) 



4>s 



V'(i) 



<t>s 



4>s 



V(i)y 



0(1) 



where we use the fact that 4>d = tt + o(l) and that tp' (9) / ip' (1) < 1 (since ip is a convex function and 
9 < 1). Since r/in = r] 2 ^, we have 



d_ 
dt 



<t>s -a 



0s 



0(1) 



= - % 



Since ip" (9) / tp' (9) is nonnegative, the coefficient of 0s — £,ip' (9) / ip' {1) in t ne nna l expression is at least 
?72, so we conclude that <p s — £ip' (9) / ip' (1) + o(l). 

7rj = 7rC + o(l) & = & + o(l). 

We first define q = (-(j> D + n9)ip'(9)/ip'(l). Since 9 = l + o(l) and <p D = 7r + o(l), we have -<f> D + Tr9 = 
o(l). Since < ip'(9) < ip'(l) (V 7 is a convex function), we have q = o(l). Wc will show that 
ttj — 7r£ — q = o(l) 

^(tt 7 - ttC - q) = -n s - 7 71 "/ ~ < - <? 
d 

= -7(tt/ + ttC - g) + o(l) 

So 7T/ — 7rC — q = o(l), and 717 = 7r£ + o(l). A similar proof shows that £/ = ££ + o(l), or we can simply 
use 

6 = C - TTJ 

= C-7TC + 0(1) 

= ec + o(i) 

0j=#C + o(l) 

To show this, we will actually show that 0/ + 05 = + ^9 2 tp'(9)/ip'(l) + o(l). In doing so we take 
advantage of the fact that multiplying by 9 behaves very much like multiplying by 1. Once this result 
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is shown, we use the fact that 4> s = £ip'(9)/ip'(l) + o(l) = £9 2 ip'(9)/ip'(l) +o(l) to get our final result. 
We begin by taking the derivative of (f>i + <j) S - £[9( + 9 2 ip'(9)/ip'(l)] 



d_ 
dt 



6( + 6 



2 ^(g) 
V>'(1) 



= 4>i + (j>s - £, 



+ 2(9(9 



^'(g) 



V>'(1) 



-??2(0/ + 0s) + »7i — (tts + ttj) - (/? + 7)0/ 



32 /(g)\ 



= -%(0/ + 0s) + Vi (^^jjy + <) - (Z 3 + 



+ £g 



+ 7 C -W, 



0'(g) 
0'(1) 

0'(g) 



+£g 



-% I 0/ + 0s-£^ 



V>'(g) 
V>'(1) 

+^(c + ^) 



+£g 



at v'(i) 

W + 7)0/ 
d 

dt V'(l) 
/?0/ - 7(0/ - £flC) 



+ 7 C -2Wj 



0'(1) 

^(g) 
^'(1) 



+ o(l) 



+ o(l) 



dt>(i); 



/ ^(T) +o(1) 



= (0/ + 0s - - ec) - ^0/ - 7(0/ - m 



+ ^0/C + W/g 2 ^| 



C/30/g 



= -m (in + 05 - - ft) - 7(0/ - £gC) 

4-^^"W ^^>"(g)V m 

Where in the last step we have used the fact that /3 — o(l) and so anything which is bounded and 
multiplied by (3 is also o(l). Continuing we have 



d_ 
dt 



n + 4>s-i 



9( + 6 



2 ^(g) 
V>'(1) 



% (0/ + 05-C7^-eC) -7^/-^0+o(l) 



( i>' 
-m l^i + 0s - ?^ 



V'(i) 

^(g) 
V'(i) 



(?72 + 7) ( 0/ + 0S - £ 



^c + g 



7(0/-^C + 0s-^)+o(l) 



2 0'(g) 
0'(1) 



+ 0(1) 



[in the first step we used the fact that 0s = ^'(0)/-0'(l)+o(l)]. So t/> / +0 s -£[gC+g 2 V'(g)/V / (l)] = o(l). 
Because = 1 + o(l) and </> s = ^'{9)/^'{l) + o(l) it follows then that 0/ = £6>C + o(l). 

6. 9 = -pet + po(i). 

This step is trivial. Since <j)i = £9Q + o(l), we have 

g = -/?0/ 
= -^gC + /3o(l) 

= -£0C + /?o(l) 
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7- C = l-4$ + (7/i8)ln0 + to(l) 

We make the observation that C = -^ t [0ip'{0)/ip'(l)] - 7C We have 



s dt 



1 - 



( 7 //3)ln( 



d 
"dt 



#(1) 



^ d 
^ C+ d7 



/W 



70 

10 



= -7C - — 



From our previous result we have £ = — [0 - 



s dt 

Our result follows immediately. 



/W 



/3o(l)]/(38. Substituting this in we have 

7<9 7/3o(l) 7 
__ 



(7/0) In* 
= o(l) 



/?0 00 



Using the final two results, we have the equations of the MFSH model, with j3 — /3£ playing the role 
of the transmission rate. This ultimately leads to the MA model with j3 — ft (K) if the conditions of the 
previous section hold. 
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